FP-Rank: An Effective Ranking Approach Based on Frequent Pattern Analysis
نویسندگان
چکیده
Ranking documents in terms of their relevance to a given query is fundamental to many real-life applications such as document retrieval and recommendation systems. Extensive studies in this area have focused on developing efficient ranking models. While ranking models are usually trained based on given training datasets, besides model training algorithms, the quality of the document features selected for model training also plays a very important aspect on the model performance. The main objective of this paper is to present an approach to discover “significant” document features for learning to rank (LTR) problem. We conduct a systematic exploration of frequent pattern-based ranking. First, we formally analyze the effectiveness of frequent patterns for ranking. Combined features, which constitute a large portion of frequent patterns, perform better than single features in terms of capturing rich underlying semantics of the documents and hence provide good feature candidates for ranking. Based on our analysis, we propose a new ranking approach called FP-Rank. Essentially, FP-Rank adopts frequent pattern mining algorithms to mine frequent patterns, and then a new pattern selection algorithm is adopted to select a set of patterns with high overall significance and low redundancy. Our experiments on the real datasets confirm that, by incorporating effective frequent patterns to train a ranking model, such as RankSVM, the performance of the ranking model can be substantially improved.
منابع مشابه
An approach to rank efficient DMUs in DEA based on combining Manhattan and infinity norms
In many applications, discrimination among decision making units (DMUs) is a problematic technical task procedure to decision makers in data envelopment analysis (DEA). The DEA models unable to discriminate between extremely efficient DMUs. Hence, there is a growing interest in improving discrimination power in DEA yet. The aim of this paper is ranking extreme efficient DMUs in DEA based on exp...
متن کاملA new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining
Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...
متن کاملFP-Viz: Visual Frequent Pattern Mining
Frequent pattern mining plays an essential role in many data analysis tasks including association-, correlation-, and causality analysis and has broad applications. Examples are market basket analysis and web click stream analysis. Although a number of efficient methods for mining frequent patterns where proposed in the past, there exist only a small number of visual exploration tools for disco...
متن کاملThe Presentation of an Approach of Evaluation and Ranking in Data Envelopment Analysis with Interval Data: a Case Study in the Evaluation and Ranking of Iran’s Provinces in the Health and Treatment Sector
Today, in every society the health and treatment sector are among the most important service sectors. Therefore, it is crucial that their performance be evaluated and examined. Although the researchers have proposed many different approaches to evaluate and rank the health sectors, no precise approach for evaluating and ranking have been reported up to now. Assessing the coefficient of variatio...
متن کاملA two phases approach for discriminating efficient candidate by using DEA inspired procedure
There are several methods to ranking DMUs in Data Envelopment Analysis (DEA) and candidates in voting system. This paper proposes a new two phases method based on DEA’s concepts. The first phase presents an aspiration rank for each candidate and second phase propose final ranking.
متن کامل